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Abstract 

Statistical mechanics is one of the most powerful and elegant tools in the quantita- 
tive sciences. One key virtue of statistical mechanics is that it is designed to examine 
large systems with many interacting degrees of freedom, providing a clue that it might 
have some bearing on the analysis of the molecules of living matter. As a result of 
data on biological systems becoming increasingly quantitative, there is a concomitant 
demand that the models set forth to describe biological systems be themselves quan- 
titative. We describe how statistical mechanics is part of the quantitative toolkit that 
is needed to respond to such data. The power of statistical mechanics is not limited 
to traditional physical and chemical problems and there are a host of interesting ways 
in which these ideas can be applied in biology. This article reports on our efforts to 
teach statistical mechanics to life science students and provides a framework for others 
interested in bringing these tools to a nontraditional audience in the life sciences. 

1 Does Statistical Mechanics Matter in Biology? 

The use of the ideas of equilibrium thermodynamics and statistical mechanics to study 
biological systems arc nearly as old as these disciplines themselves. Whether thinking about 
the binding constants of transcription factors for their target DNA or proteins on HIV 
virions for their target cell receptors, often the first discussion of a given problem involves 
a hidden assumption of equilibrium. There are two key imperatives for students of the life 
sciences who wish to explore the quantitative underpinnings of their discipline: i) to have a 
sense of when the equilibrium perspective is a reasonable approximation and ii) given those 
cases when it is reasonable, to know how to use the key tools of the calculus of equilibrium. 
Our experiences in teaching both undergraduate and graduate students in the life sciences 
as well as in participating both as students and instructors in the Physiology Course at the 
Marine Biological Laboratory in Woods Hole drive home the need for a useful introduction 
to statistical mechanics for life scientists. 

This paper is a reflection of our attempts to find a minimalistic way of introducing 
statistical mechanics in the biological setting that starts attacking biological problems that 
students might care about as early as possible. We view this as part of a growing list of 
examples where quantitative approaches are included in the life sciences curriculum [1, 2, 
|3]. As will be seen throughout the paper, one of the key components of this approach is 
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to develop cartoons that provide a linkage between familiar biological concepts and their 
mathematical incarnation. The courses we teach often involve a very diverse mixture of 
students interested in how quantitative approaches from physics might be useful for thinking 
about living matter. On the one hand, we have biology students that want to make the 
investment to learn tools from physics. At the same time, about one third of our students are 
from that ever-growing category of physics students who are excited about taking what they 
know and using it to study living organisms. As a result, we face a constant struggle to not 
lose either the interest or understanding of one of these two constituencies. The challenge is 
to be interdisciplinary while maintaining strong contact with the core disciplines themselves. 

One of the key questions that must be addressed at the outset has to do with the question 
of when equilibrium ideas are a viable approach in thinking about real biological problems. 
Indeed, given the fact that biological systems are in a constant state of flux, it is reasonable to 
wonder whether equilibrium ideas are ever applicable. Nevertheless, there are a surprisingly 
large number of instances when the time scales conspire to make the equilibrium approach 
a reasonable starting point, even for examining some processes in living cells. To that end, 
we argue that the legitimacy of the equilibrium approach often centers on the question of 
relative time scales. To be concrete, consider several reactions linked together in a chain 
such as 

a^bAc. (1) 

fc_ 

For simplicity, we consider a set of reactions in which the terminal reaction is nearly irre- 
versible. The thrust of our argument is that even though the conversion of B to C is bleeding 

off material from the A and B reaction, if the rate of B to C conversion is sufficiently slow 

fe + 

compared to the back reaction B —* A, then the A <^ B reaction will always behave in- 

fc_ 

stantaneously as though it is in equilibrium. There are a range of similar examples that 
illustrate the way in which important biological problems, when boxed off appropriately, 
can be treated from the equilibrium perspective [1]. The goal of this paper is to use simple 
model problems to illustrate how equilibrium ideas can be exploited to examine biologically 
interesting case studies. 

2 Boltzmann, Gibbs and the Calculus of Equilibrium 

We find that statistical mechanics can be introduced in a streamlined fashion by proceeding 
axiomatically. We start by introducing a few key definitions and then arguing that just 
as classical mechanics can be built exclusively around repeated uses of F = ma, statistical 
mechanics has its own fundamental law (the Boltzmann distribution) from which results 
flow almost effortlessly and seemingly endlessly. There is a great deal of precedent for this 
axiomatic approach as evidenced by several amusing comments from well known statistical 
mechanics texts. In the preface to his book [5], Daniel Mattis comments on his thinking 
about what classes to take on statistical mechanics upon his arrival at graduate school. 
"I asked my classmate JR Schrieffer, who presciently had enrolled in that class, whether 
I should chance it later with a different instructor. He said not to bother - that he could 
explain all I needed to know about this topic over lunch. On a paper napkin, Bob wrote 
e -/3ff "That's it in a nutshell". "Surely you must be kidding Mr. Schrieffer" I replied (or 
words to that effect) "How could you get the Fermi-Dirac distribution out of THAT?" "Easy 
as pie" was the reply . . . and I was hooked" . 

Similarly, in speaking of the Boltzmann distribution, Feynman notes in the opening 
volley of his statistical mechanics book: "This fundamental law is the summit of statistical 
mechanics, and the entire subject is either the slide-down from this summit, as the principle 
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a) b) c) d) 




Figure 1: Microstates of DNA in solution, (a) Fluorescence microscopy images of A-phage 
DNA (7] (reprinted with permission). The DNA molecule jiggles around in solution and 
every configuration corresponds to a different microstate. (b) The film strip shows how 
at every instant at which a picture is taken, the DNA configuration is different. From a 
mathematical perspective, we can represent the configuration of the molecule either by using 
(c) a discrete set of vectors {r^} or (d) by the continuous function r(s). 



is applied to various cases, or the climb-up to where the fundamental law is derived..." [BJ. 
Our sense is that in a first exposure to statistical mechanics for students of the life sciences, 
an emphasis on the slide-down from the summit which illustrates the intriguing applications 
of statistical mechanics is of much greater use than paining through the climb to that summit 
with a consideration of the nuances associated with where these distributions come from. 
As a result, we relegate a derivation of the Boltzmann distribution to the appendix at the 
end of this paper. 

So what is this "summit" that Feynman speaks of? Complex, many-particle systems such 
as the template DNA, nucleotides, primers and enzymes that make up a polymerase chain 
reaction, familiar to every biology student, can exist in an astronomical number of different 
states. It is the job of statistical mechanics to assign probabilities to all of these different 
ways (the distinct "microstates" ) of arranging the system. The summit that Feynman speaks 
of is the simple idea that each of these different arrangements has a probability proportional 
to e~ l3Ei , where 2% is the energy of the microstate of interest which is labeled by the index i. 
To make this seem less abstract, we begin our analysis of statistical mechanics by describing 
the notion of a microstate in a way that will seem familiar to biologists. 

One of our favorite examples for introducing the concept of a microstate is to consider 
a piece of DNA from the bacterial virus known as A-phagc. If one of these ss 48,500 base 
pair long DNA molecules is fluorescently labeled and observed through a microscope as it 
jiggles around in solution, we argue that the different conformations adopted by the molecule 
correspond to its different allowed microstates. Of course, for this idea to be more than just 
words, we have to invoke some mathematical way to represent these different microstates. 
As shown in fig. [T] it is possible to characterize the states of a polymer such as DNA either 
discretely (by providing the x, y, z coordinates of a set of discrete points on the polymer) or 
continuously (by providing the position of each point on the polymer, r(s), as a function of 
the distance s along the polymer). 

A second way in which the notion of a microstate can be introduced that is biologi- 
cally familiar is by discussing ligand-receptor interactions (and binding interactions more 
generally). This topic is immensely important in biology as can be seen by appealing to 
problems such as antibody-antigen binding, ligand-gated ion channels and oxygen binding 
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to hemoglobin, for example. Indeed, Paul Ehrlich made his views on the importance of 
ligand-receptor binding evident through the precept: "Corpora non agunt nisi ligata - A 
substance is not effective unless it is linked to another" [8]. Whether discussing signaling, 
gene regulation or metabolism, biological action is a concert of different binding reactions 
and we view an introduction to the biological uses of binding and how to think about such 
binding using statistical mechanics as a worthy investment for biology and physics students 
alike. 

To treat cases like these, it is convenient to imagine an isolated system represented by a 
box of solution which contains a single receptor and L ligands. One of the pleasures of using 
this example is that it emphasizes the simplifications that physicists love In particular, 
as shown in fig. [2] we introduce a lattice model of the solution in which the box is divided 
up into f2 sub-boxes. These sub-boxes have molecular dimensions and can be occupied by 
only one ligand molecule at a time. We also assume that the concentration of ligand is 
so low that they do not interact. In this case, the different microstates correspond to the 
different ways of arranging the L ligands amongst these f2 elementary boxes. Although it 
is natural for biological scientists who are accustomed to considering continuous functions 
and concentrations to chafe against the discretization of a solution in a lattice model, it is 
fairly easy to justify this simplification. We may choose any number of boxes fi, of any size. 
At the limit of a large number of very small boxes, they may have molecular dimensions. In 
practice, the mathematical results are essentially the same for most choices where ^> L. 
that is, where the solution is dilute. Given L ligands and f2 sites that they can be distributed 
on, the total number of microstates available to the system (when no ligands are bound to 
the receptor) is 

number of microstates = — r— rr. (2) 

L . ( — Lj ! 

The way to see this result is to notice that for the first ligand, we have distinct possible 
places that we can put the ligand. For the second ligand, we have only Q — 1 choices 
and so on. However, once we have placed those ligands, we can interchange them in L\ 
different ways without changing the actual microstate. In fact, these issues of rearrangement 
(distinguishability vs. indistinguishability) are subtle, but unimportant for our purposes 
since they don't change the ultimate results for classical systems such as the biological 
problems addressed here. We leave it as an exercise for the reader to show that the results 
in this paper would remain unaltered if considering the ligands to be distinguishable. 

During our participation in the MBL Physiology Course in the summer of 2006, our 
favorite presentation of this notion of microstate was introduced by a PhD biology student 
from Rockefeller University. During her presentation, she used a box from which she cut 
out a square in the bottom. She then taped a transparency onto the bottom of the box and 
drew a square grid precisely like the one shown in fig. [2] and then constructed a bunch of 
"molecules" that just fit into the squares. Finally, she put this box on an overhead projector 
and shook it, repeatedly demonstrating the different microstates available to "ligands in 
solution" . 

These different examples of microstates also permit us to give a cursory introduction 
to the statistical mechanical definition of entropy. In particular, we introduce entropy as a 
measure of microscopic degeneracy through the expression 

S{V,N) = k B ln W(V,N), (3) 

where k B — 1.38 x 10~ 23 J/K the all-important constant of statistical mechanics known 
as the Boltzmann constant, S(V,N) is the entropy of an isolated system with volume V 
containing N particles, and W(V, N) is the number of distinct microstates available to that 
system. We also argue that the existence of the entropy function permits the introduction 
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Figure 2: Lattice model for solution. Ligands in solution with their partner receptor. A 
simplified lattice model posits a discrete set of sites (represented as boxes) that the ligands 
can occupy and permits a direct evaluation of the various microstates (ways of arranging 
the ligands) . The first three microstates shown here have the receptor unoccupied while the 
fourth microstate is one in which the receptor is occupied. 
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of the all-important variational statement of the second law of thermodynamics which tells 
us how to select out of all of the possible macrostates of a system, which state is most likely 
to be observed. In particular, for an isolated system (i.e. one that has rigid, adiabatic, 
impermeable walls, where no matter or energy can exit or enter the system) the equilibrium 
state is that macrostate that maximizes the entropy Stated differently, the macroscopically 
observed state will be that state which can be realized in the largest number of microscopic 
states. 

Now that we have the intuitive idea of microstates in hand and have shown how to 
enumerate them mathematically, and furthermore we have introduced Gibbs' calculus of 
equilibrium in the form of the variational statement of the second law of thermodynamics, 
we are prepared to introduce the Boltzmann distribution itself. The Boltzmann distribution 
derives naturally from the second law of thermodynamics as the distribution that maximizes 
the entropy of a system in contact with a thermal bath (for a detailed derivation refer to 
the appendix) . Statistical mechanics describes systems in terms of the probabilities of the 
various microstates. This style of reasoning is different from the familiar example offered 
by classical physics in disciplines such as mechanics and electricity and magnetism which 
centers on deterministic analysis of physical systems. By way of contrast, the way we do 
"physics" on systems that can exist in astronomical numbers of different microstates is to 
assign probabilities to them. 

As noted above, one useful analogy is with the handling of classical dynamics. All science 
students have at one time or another been taught Newton's second law of motion (F = ma) 
and usually, this governing equation is introduced axiomatically. There is a corresponding 
central equation in statistical mechanics which can also be introduced axiomatically. In 
particular, if we label the i th microstate by its energy Ei, then the probability of that 
microstate is given by 

Vi = \e~" E \ (4) 

where — l/kgT. The factor e~@ Ei is called the Boltzmann Factor and Z is the "partition 
function", which is defined as Z — '^2 i e~ l3Ei . Note that from this definition it follows 
that the probabilities are normalized, namely YliPi = 1. Intuitively, what this distribution 
tells us is that when we have a system that can exchange energy with its environment, the 
probability of that system being in a particular microstate decays exponentially with the 
energy of the microstates. Further, ksT sets the natural energy scale of physical biology 
where the temperature T for biological systems is usually around 300 K by telling us that 
microstates with energies too much larger than ksT w 4.1 pN nm w 0.6 kcal/mol ~ 
2.5 kJ/mol are thermally inaccessible. This first introduction to the Boltzmann distribution 
suffices to now begin to analyze problems of biological relevance. 



3 State Variables and States and Weights 

One way to breathe life into the Boltzmann distribution is by constructing a compelling and 
honest correspondence between biological cartoons and their statistical mechanical meaning. 
Many of the cartoons familiar from molecular biology textbooks are extremely information- 
rich representations of a wealth of biological data and understanding. These informative 
cartoons can often be readily adapted to a statistical mechanics analysis simply by assigning 
statistical weights to the different configurations as shown in fig. [3] This first example 
considers the probability of finding an ion channel in the open or closed state in a simple 
model in which it is assumed that the channel can exist in only two states. We use ion 
channels as our first example of states and weights because, in this way, we can appeal to 
one of the physicists favorite models, the two-level system, while at the same time using a 
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Figure 3: Ion channel open probability, (a) Current as a function of time for an ion channel 
showing repeated transitions between the open and closed states [10 . (b) States and weights 
for an ion channel. The cartoon shows a schematic of the channel states which have different 
energies, and by the Boltzmann distribution, different probabilities. 



biological example that is of central importance. During our course, we try to repeat this 
same basic motif of identifying in cartoon form the microscopic states of some biological 
problem and then to assign those different states and their corresponding statistical weights 
(and probabilities). 

As shown in fig. |3j in the simplest model of an ion channel we argue that there are two 
states: the closed state and the open state. In reality some ion channels have been shown 
to exist in more than two states. However, the two state approximation is still useful for 
understanding most aspects of their electrical conductance behavior as shown by the trace 
in fig. [3^, where only two states predominate. In order to make a mapping between the 
biological cartoon and the statistical weights, we need to know the energies of the closed 
and open states, e c i osc d and e pcn- Also, for many kinds of channels the difference in energy 
between closed and open can be tuned by the application of external variables such as an 
electric field (voltage-gated channels), membrane tension (mechanosensitive channels) and 
the binding of ligands (ligand-gated channels) [TT] . 

We find that a two-state ion channel is one of the cleanest and simplest examples for 
introducing how a statistical mechanics calculation might go. In addition, these ideas on 
ion channels serve as motivation for the introduction of a convenient way of characterizing 
the "state" of many macromolecules of biological interest. We define the two-state variables 
a which can take on either the values or 1 to signify the distinct conformation or state of 
binding of a given molecule. For example, in the ion channel problem a = corresponds 
to the closed state of the channel and a = 1 corresponds to the open state (this choice is 
arbitrary, we could equally have chosen to call er = 1 the closed state, but this choice makes 
more intuitive sense). As a result, we can write the energy of the ion channel as 

E(a) = (1 - Cr)e c l oscd + CrCopcn- (5) 

Thus, when the channel is closed, a — and the energy is e c iosod- Similarly, when a = 1, the 
channel is open and the energy is e opC n- Though this formalism may seem heavy handed, 
it is extremely convenient for generalizing to more complicated problems such as channels 
that can have a ligand bound or not as well being open or closed. Another useful feature 
of this simple notation is that it permits us to compute quantities of experimental interest 
straight away. Our aim is to compute the open probability P pen which, in terms of our 
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state variable a, can be written as (a), where (• • • ) denotes an average. When (a) w this 
means that the probability of finding the channel open is low. Similarly, when (a) ~ 1, this 
means that it is almost certain that we will find the channel open. The simplest way to 
think of this average is to imagine a total of N channels and then to evaluate the fraction 
of the channels that are in the open state, N open /N. 

To compute the probability that the channel will be open, we invoke the Boltzmann 
distribution and, in particular, we evaluate the partition function given by 

Z = E = e~^" d + e~^« . (6) 

<T = 

As noted above, for the simple two-state description of a channel, the partition function is a 
sum over only two states, the closed state and the open state. Given the partition function, 
we then know the probability of both the open and closed states via, p open = e _ ^ Copon /Z 
and ^closed = e" /3ecl ° BCd /Z. 

Using the partition function of eq. [6] we see that the open probability is given by (really, 
it is nothing more than p pcn) 

(<x)=£>p(<r)=Oxp(0) + lxp(l). (7) 

As a result, we see that the open probability is given by 

e ~^° pcn _ 1 

~~ e -/3«clo B od -)- g-/3eopon ~~ g/3(eopen-«closed) 4- I ' ^ ' 

This result illustrates more quantitatively the argument made earlier that the energy scale 
&bT is the standard that determines whether a given microstate is accessible. It also shows 
how in terms of energy what really matters is the relative difference between the different 
states (Ae = e op en — ^closed) and not their absolute values. An example of the probability of 
the channel being open is shown in fig. [4] for several different choices of applied voltage in 
the case where the voltage is used to tune the difference between e open and e c i ose d- In turn, 
measuring p opon experimentally tells you Ae between the two states. 

Often, when using statistical mechanics to analyze problems of biological interest, our 
aim is to characterize several features of a macromolecule at once. For example, for an ion 
channel, the microstates are described by several "axes" simultaneously. Is there a bound 
ligand? Is the channel open or not? Similarly, when thinking about post-translational 
modifications of proteins, we are often interested in the state of phosphorylation of the 
protein of interest |13U14j . But at the same time, we might wish to characterize the protein 
as being active or inactive, and also whether or not it is bound to allosteric activators or 
inhibitors. As with the ion channel, there are several variables needed to specify the overall 
state of the macromolecule of interest. Although this is admittedly oversimplified, countless 
biologically relevant problems can be approached by considering two main states (bound 
vs. unbound, active vs. inactive, phosphorylated vs. unphosphorylated, etc.) each of which 
can be characterized by its own state variable Oi. The key outcome of this first case study 
is that we have seen how both the Boltzmann distribution and the partition function are 
calculated in a simple example and have shown a slick and very useful way of representing 
biochemical states using simple two-state variables. 

4 Ligand- Receptor Binding as a Case Study 

The next step upward in complexity is to consider ligand-receptor interactions where we 
must keep track of two or more separate molecules rather than just one molecule in two 
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Figure 4: Probability of open and closed states for a voltage-sensitive ion channel, (a) 
Current traces for a sodium channel for four different applied voltages. The histograms to 
the right of each current trace show the fraction of time spent in the two states [T^] • (b) 
Probability that the channel is open as a function of the applied voltage. The data points 
correspond to computing the fraction of time the channel spends open for traces like those 
shown in (a). The curve shows a fit to the data using eq. [8j where /3(e opcn — Closed) = 
/fo(K PP iicd - Vb). The fit yields a ~ -0.096 fc B T/mV and V = -94 mV. 
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Figure 5: States and weights diagram for ligand-receptor binding. The cartoons show a 
lattice model of solution for the case in which there are L ligands and lattice sites. In 
the upper panel, the receptor is unoccupied. In the lower panel, the receptor is occupied 
by a ligand and the remaining L — 1 ligands are free in solution. A given state has a 
weight dictated by its Boltzmann factor. The multiplicity refers to the number of different 
microstates that share that same Boltzmann factor (for example, all of the states with no 
ligand bound to the receptor have the same Boltzmann factor) . The total statistical weight 
is given by the product of the multiplicity and the Boltzmann factor. 



different states. Examples of this kind of binding include: the binding of acetylcholine to the 
nicotinic acetylcholine receptor [13] , the binding of transcription factors to DNA |T5] , the 
binding of oxygen to hemoglobin, the binding of antigens to antibodies [T7| and so on. To 
examine the physics of fig. [2] imagine there are L ligand molecules in the box characterized 
by fl lattice sites as well as a single receptor with one binding site as shown. Our ambition 
is to compute the probability that the receptor will be occupied (pbound) as a function of 
the number (or concentration) of ligands. 

To see the logic of this calculation more clearly, fig. [5j shows the states available to this 
system as well as their Boltzmann factors, multiplicities (i.e. the number of different ways 
of arranging L or L — 1 ligands in solution) and overall statistical weights which are the 
products of the multiplicities and the Boltzmann factor. The key point is that there are 
only two classes of states: i) all of those states for which there is no ligand bound to the 
receptor and ii) all of those states for which one of the ligands is bound to the receptor. The 
useful feature of this situation is that although there are many realizations of each class of 
state, the Boltzmann factor for each of these individual realizations for each of the classes 
of state are all the same as shown in fig. [5] since all microstates in each class have the same 
energy. 

To compute the probability that the receptor is occupied, we need to construct a ratio in 
which the numerator involves the accumulated statistical weight of all states in which one 
ligand is bound to the receptor and the denominator is the sum over all states. This idea is 
represented graphically in fig. [6] What the figure shows is that there are a host of different 
states in which the receptor is occupied: first, there are L different ligands that can bind to 
the receptor, second, the L — 1 ligands that remain behind in solution can be distributed 
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amongst the f2 lattice sites in many different ways. In particular, we have 



weight when receptor occupied = e /3eb ° u " d x 



E 



,-/3(£-l)e BO 



(9) 



receptor solution 



free ligands 



where we have introduced ebound a s the energy for the ligand when bound to the receptor and 
^solution as the energy for a ligand in solution. The summation ^solution 1S an instruction 
to sum over all of the ways of arranging the L — 1 ligands on the £1 lattice sites in solution 
with each of those states assigned the weight e - '^ 1 ' -1 ' 630111 " 011 . Since the Boltzmann factor 
is the same for each of these states, what this sum amounts to is finding the number of 
arrangements of the L — 1 ligands amongst the f2 lattice sites and yields 



-/3(L-l)e aolution 



-/3(L-l)e solution _ 



solution 



(L -l)!(n- (£-!))!■ 



(10) 



To effect this sum, we have exploited precisely the same counting argument that led to eq. [2] 
with the only change that now we have L — 1 ligands rather than L. The denominator of 
the expression shown in fig. [6] is the partition function itself since it represents a sum over 
all possible arrangements of the system (both those with the receptor occupied and not) 
and is given by 



Z(L,il)= 



= -/3ie S olutio„ . 



-/3e b o 



E 



,-/3(L-l)e solutio 



(11) 



solution 



solution 



none bound 



ligand bound 



We already evaluated the second term in the sum culminating in cq. |10| To complete our 
evaluation of the partition function, we have to evaluate the sum ^solution e 



all of the ways of arranging the L ligands on the ft lattice sites with the result 

O! 



E 

solution 



e -/3Le B0 i ution _ e -/3Le ao i uti , 



L\(fl-L)\ 



over 



(12) 



In light of these results, the partition function can be written as 

SV. 



-/3Le so i utio 



g-/3ebound e -/3(£-l)e 8 olution 



We can now simplify this result by using the approximation that 



(L-l)l(O-(i-l))! 



(13) 



(14) 



which is justified as long as » L. To see why this is a good approximation consider the 
case when Q = 10 6 and L — 10 resulting in 



10 6 ! 



(10 6 - 10)! 



10 b ■ (10 b - 1) • (10 b - 2) • ... • (10 b - 9) ~ (10 b ) 



6\10 



(15) 



Note that the approximate result is the largest term in the sum which we obtain by multi- 
plying out all the terms in parentheses. We leave it to the reader to show that the correction 
is of the order 0.001 %, five orders of magnitude smaller than the leading term. 
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Figure 6: Probability of receptor occupancy. The figure shows how the probability of 
receptor occupancy can be written as a ratio of the weights of the favorable outcomes and 
the weights of all outcomes. In this case the numerator is the result of summing over the 
weights of all states in which the receptor is occupied. 



With these results in hand, we can now write pbound as 

p -/3e b ou»d O L -\ r -0(L-l)e an u, t .^ 

(L-iy nfi N, 

L\ ^ e {L-l)\ 

This result can be simplified by multiplying the top and bottom by J^-e l3Lesolution , resulting 
in 

e -/3e b ou„d e -/3(£-lKoi„tion L\ ^Le 3olution 
_ (L-iy. v n L C / 17 \ 

il_p-pLe go l Mion I p-pCbound " p-p(-L-l)e S olution -i^-pPLt solution 

L\ ^ c (£-1)1 ° 

We combine the two fractions in the previous equation and note that L\/{L — 1)! = i 
and that e -^bou„d e -/3(£-iHoiutio„ x e -/3£e S oiut,o„ = e -/3(fbou„d-e a oiutio„)^ Finally, we define 

the difference in energy between a bound ligand and a ligand that is free in solution as 
Ae = Ebound — ^solution- The probability of binding to the receptor becomes 

L e -/3Ae 

Pbound = \ p - 0A e- ^ 

The overall volume of the box is Vbox aid this permits us to rewrite our results using 
concentration variables. In particular, this can be written in terms of ligand concentration 
c = L/Vhox if we introduce cq = f2/Vbox, a "reference" concentration where every lattice 
position in the solution is occupied. The choice of reference concentration is arbitrary. For 
the purposes of fig.|7]we choose the elementary box size to be 1 nm 3 , resulting in Co ~ 0.6 M. 
This is comparable to the standard state used in many biochemistry textbooks of 1 M. The 
binding curve can be rewritten as 

^ e -/3Ae 

Pbound = 7 ,°_c_ _/3Ae ' ( 19 ) 

This classic result goes under many different names depending upon the field. In biochem- 
istry, this might be referred to as a Hill function with Hill coefficient one. Chemists and 
physicists might be more familiar with this result as the Langmuir adsorption isotherm 
which provides a measure of the surface coverage as a function of the partial pressure or 
the concentration. Regardless of names, this expression will be our point of departure for 



12 




Figure 7: Simple binding curve. This curve shows Abound as calculated in eq. 19 The 
different curves correspond to different choices for the strength of the ligand-receptor binding 
energy, Ae, given a reference concentration of c = 0.6 M. The corresponding dissociation 
constants are shown as well. 



thinking about all binding problems and an example of this kind of binding curve is shown 
in fig. [7J This reasoning can be applied to binding in real macromolecules of biological 
interest such as myoglobin as shown in fig. [8] Though many problems of biological interest 
exhibit binding curves that are "sharper" (i.e. there is a more rapid change in pbound with 
ligand concentration) than this one, ultimately, even those curves are measured against the 
standard result derived here. 

So far, we have examined binding from the perspective of statistical mechanics. That 
same problem can be addressed from the point of view of equilibrium constants and the 
law of mass action, and it is enlightening to examine the relation between the two points of 
view. To see the connection, the reaction of interest is characterized by the stoichiometric 
equation 

L + R^LR. (20) 
This reaction is described by a dissociation constant given by the law of mass action as 

Kd= jLRj- (21) 

It is convenient to rearrange this expression in terms of the concentration of ligand-receptor 
complexes as 

[LR] = ffl. (22) 

As before, our interest is in the probability that the receptor will be occupied by a ligand. 
In terms of the concentration of free receptors and ligand-receptor complexes, Pbound can be 
written as 

Pbound - [R] - [LR] ■ (23) 
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Oxygen partial pressure (mmHg) 



Figure 8: Example of ligand-receptor binding. The binding of oxygen to myoglobin as a 
function of the oxygen partial pressure. The points correspond to the measured occupancy 
of myoglobin as a function of the oxygen partial pressure [18] (because oxygen is a gas, 
partial pressure is used rather than concentration) and the curve is a fit based upon the 
one-parameter model from eq. |19| The fit yields Ae = — 4.9/cbT using a standard state 
cq = 760 mmHg = 1 atm, which also corresponds to a dissociation constant Kd = 5.6 mmHg. 



We are now poised to write Abound itself by invoking eq. |22[ with the result that 

mm h 

Pbound = —— d [L][B] = \ L] ■ (24) 

+ K d 1 + K d 

What we see is that Kd is naturally interpreted as that concentration of ligand at which the 
receptor has a probability of 1/2 of being occupied. 

We can relate our two descriptions of binding as embodied in eqns. [19] and [24] Indeed, 
these two equations permit us to go back and forth between the statistical mechanical and 
thermodynamic treatment of binding through the recognition that the dissociation constant 
can be written as Kd = coe' 3Ae . To see that, we note that both of these equations have the 
same functional form (pbound = + x)) allowing us to set [L]/Kd = ^" e_/3Ae an d noting 
that [L] = c by definition. This equation permits us to use measured equilibrium constants 
to determine microscopic parameters such as the binding energy as illustrated in figs. [7] and 

m 



5 Statistical Mechanics and Transcriptional Regulation 

Transcriptional regulation is at the heart of much of biology. With increasing regularity, 
the data that is being generated on transcriptional regulation is quantitative. In particular, 
it is possible to quantify how much a given gene is expressed, where within the organism 
and at what time. Of course, with the advent of such data, it is important that models of 
transcriptional regulation keep pace with the measurements themselves. The first step in 
the transcription process is the binding of RNA polymerase to its target DNA sequence at 
the start of a gene known as a promoter. From a statistical mechanics perspective, the so- 
called "thermodynamic models" of gene expression are founded upon the assumption that 
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Figure 9: RNA polymerase nonspecific reservoir. This figure represents DNA as a series 
of binding sites (schematized as boxes) for RNA polymerase. The number of nonspecific 
binding sites is N^g. 



the binding probability of RNA polymerase can be used as a surrogate for the extent of gene 
expression itself [THl HE1 121] • Here too, the use of equilibrium ideas must be justified 
as a result of separation of time scales such that the binding step can equilibrate before 
the process of transcription itself begins. The formulations derived above can be directly 
applied to the process of transcription by binding of RNA polymerase to DNA. The fact 
that DNA is an extended, linear polymer makes it seem at first blush like a very different 
type of binding problem than the protein-ligand interactions discussed above. Nevertheless, 
as we will show below, these same basic ideas are a natural starting point for the analysis 
of gene expression. 

The way we set up the problem is shown in fig. [9] First, we argue that the genome 
can be idealized as a "reservoir" of N^s nonspecific binding sites. We assume that RNA 
polymerase may bind with its footprint starting at any base pair in the entire genome, and 
that almost all of these possible binding sites are nonspecific ones. For the case of E. coli, 
there are roughly 500 — 2,000 polymerase molecules for a genome of around 5 x 10 6 base 
pairs [22 . Amongst these nonspecific sites, there is one particular site (the promoter for the 
gene of interest) that we are interested in considering. In particular, we want to know the 
probability that this specific site will be occupied. 

As noted above, the simplest model for RNA polymerase binding argues that the DNA 
can be viewed as Nns distinct boxes where we need to place P RNA polymerase molecules, 
only allowing one such molecule per site. This results in the partial partition function 
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characterizing the distribution of polymerase molecules on the nonspecific DNA as 



n Z"-PY. * . ■ (25) 

v v ' Boltzmann weight 

number of arrangements 

We will use the notation e^ d to characterize the binding energy of RNA polymerase to 
specific sites (promoters) and e^f to characterize the binding energy for nonspecific sites. 
(A note of caution is that this model is overly simplistic since there is a continuum of 
different binding energies for the nonspecific sites [23J.) We are now poised to write down 
the total partition function for this problem which broadly involves two classes of states: 
i) all P RNA polymerase molecules are bound nonspecifically (note the similarity to the 



partition function for ligand- receptor binding as shown in eq. Ill, ii) one of the polymerase 
molecules is bound to the promoter and the remaining P — 1 polymerase molecules are 
bound nonspecifically. Given these two classes of states, we can write the total partition 
function as 



Z(P,N NS )= Z NS (P,N NS ) +Z NS (P- l,N N s)e- p &. (26) 

v v ' " v ' 

empty promoter RNAP on promoter 

To find the probability that RNA polymerase is bound to the promoter of interest, we 
compute the ratio of the weights of the configurations for which the RNA polymerase is 
bound to its promoter to the weights associated with all configurations. This is presented 



schematically in fig. 10 and results in 



_ (P-l)!(jV ws -(P-l))! e e , 9r , 

Pbound - ppe „s 0(P _ 1)e NS 0eS • 

P\(N NS -P)\ e + (P-l)!(jVjv S -(P-l))! e e 

Although this equation looks extremely grotesque it is really just the same as eq. [19] and is 



illustrated in fig. 10 In order to develop intuition for this result, we need to simplify the 



equation by invoking the approximation nv^f^pTi — (Nns) P \ which holds if P -C N^s- 
Note that this same approximation was invoked earlier in our treatment of ligand-receptor 
binding. In light of this approximation, if we multiply the top and bottom of eq. [27] by 
\P\/ {Nns) \ e l3Pe p d , we can write our final expression for Pbound as 

Pbound(P,N NS ) = — — -g -^g- . (28) 

p s e 1 [ • p d ~ p d ' + 1 

Once again, it is the energy difference Ae that matters rather than the absolute value of any 
of the particular binding energies. Furthermore the difference between a strong promoter 
and a weak promoter can be considered as equivalent to a difference in Ae |16j . 

The problem of promoter occupancy becomes much more interesting when we acknowl- 
edge the fact that transcription factors and other proteins serve as molecular gatekeepers, 
letting RNA polymerase bind at the appropriate time and keeping it away from the pro- 
moter at others. Different individual transcription factors can either activate the promoter 
through favorable molecular contacts between the polymerase and the activator or they can 
stand in the way of polymerase binding (this is the case of repressors). An example of 
repression is shown in fig. |11| The same ideas introduced above can be exploited for exam- 
ining the competition between repressor and polymerase for two overlapping binding sites 
on the DNA. In this case, there are three classes of states to consider: i) empty promoter, 
ii) promoter occupied by RNA polymerase and iii) promoter occupied by repressor. In this 
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Figure 10: Probability of RNA polymerase binding. The probability of polymerase binding 
is constructed as a ratio of favorable outcomes (i.e. promoter occupied) to all possible out- 
comes. We are assuming that RNA polymerase is always bound to DNA, either specifically 
or nonspecifically. 



case, the total partition function is given by 
Z tot (P,R,N NS ) = Z(P,R,N NS ) +Z(P-l,R,N NS )e- <* + Z(P,R-l,N NS )e-P e r*, 

s v ' s * ' s v ' 

empty promoter RNAP on promoter repressor on promoter 

(29) 

where we have written the total partition function as a sum over partial partition functions 
which involve sums over certain restricted sets of states. Each Z is written as a function of P 
and R, the number of polymerases and repressors in the nonspecific reservoir, respectively. 
We have introduced e r d, which accounts for the energy of binding of the repressor to a 
specific site (efd) 01 t° a nonspecific site (e^ s ). For example, the term corresponding to the 
empty promoter can be written as 

The other two terms have the same form except that P goes to P — 1 or that R goes to 
R-l. 

Once we have identified the various competing states and their weights, we are in a 
position to ask key questions of biological interest. For example, what is the probability 
of promoter occupancy as a function of repressor concentration? The results worked out 
above now provide us with the tools in order to evaluate the probability that the promoter 
will be occupied by RNA polymerase. This probability is given by the ratio of the favorable 
outcomes to all of the outcomes. In mathematical terms, that is 

I p u at \ Z(P-\,R,N NS )e-^ 

Pbound\r, H, iV NS ) — — g — — . 

Z(P, R, N NS ) + Z(P — 1, R, N NS )e~^ + Z(P, R - l,N NS )e-^* 

(31) 

As argued above, this result can be rewritten in compact form by dividing top and bottom 
by Z(P — 1, R, Nws) e ~ pd and by invoking the approximation 

N NS l (N NS f (N NS f 

P\R\(N NS - P - R)\ ~ PI R\ 1 ' 

which amounts to the physical statement that there are so few polymerase and repres- 
sor molecules in comparison with the number of available sites, iVjvs, that each of these 
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Figure 11: States and weights for promoter in the presence of repressor. The promoter is 
labeled in dark yellow and the repressor binding site (operator) is labeled in brown. Notice 
the overlap between the promoter and the repressor binding site, which is denoted in green. 
The weights of these different states are a product of the multiplicity of the state of interest 
and the corresponding Boltzmann factor. 



molecules can more or less fully explore those Nns sites without feeling the presence of 
each other. The resulting probability is 

Pbou7id(P, R, N NS ) = -zT-s — t^st- — s • (33) 

Of course, a calculation like this is most interesting when it sheds light on some exper- 
imental measurement. In this case, we can appeal to measurements on one of the classic 
regulatory networks in biology, namely, the lac operon. The lac promoter controls genes that 
are responsible for lactose utilization by bacteria. When the operon is "on" , the bacterium 
produces the enzymes necessary to import and digest lactose. By way of contrast, when 
the operon is "off", this enzymes are lacking (or expressed at low "basal" levels). It has 
been possible to measure relative changes in the production of this enzyme as a function of 
both the number of repressor molecules and the strength of the repressor binding sites [H] . 
This relative change in gene expression is defined as the concentration of protein product 
in the presence of repressor divided by the concentration of protein product in the absence 
of it. In order to connect this type of data to the thermodynamic models we resort to one 
key assumption, namely that the level of gene expression is linearly related to Abound; the 
probability of finding RNA polymerase bound to the promoter. Once this assumption is 
made, we can compute the fold-change in gene expression as 

t , , , . Pbound(-R i= 0) 

fold-change m gene expression = -. (34) 

Pbound(-R = 0) 

After inserting eq. [33] in the expression for the fold-change we find 

1 + Mns. e P(4d-4^ 

fold-change in gene expression = — -= r^r- — — . (35) 

l + ^ e /3(sV^/)(l+ « e -^-^/)) 1 ' 
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Figure 12: Fold-change due to repression. Experimental data on the fold-change in gene 
expression as a function of repressor concentration and binding affinity to DNA |24j and 
corresponding determination of the binding strength of repressor to DNA given by the 
theoretical model. Experimentally, the different binding strengths are changed by varying 
the DNA sequence of the site where repressor binds. For each one of these DNA constructs 



only one parameter, the difference in binding energy Ae is obtained using eq. 36 



Finally, we note that in the case of a weak promoter such as the lac promoter, in vitro 
measurements suggest that the factor ^S-e l3 ^ e p d ~ e '" i ' is of the order of 500 [16]. This makes 
the second term in the numerator and denominator of eq. [35] much bigger than one. In this 
particular case of weak promoter the fold-change becomes independent of RNA polymerase 
as follows 

■ ^ e «sV</> / R s eNS 

fold-change m gene expression ~ — -= ^ — — =1-1 e ^ rd rd ' 

(36) 

For binding to a strong promoter, this approxima tion is no longer valid since the factor 

show the experimental data for 

different concentrations of repressor and values of 



EMs_ e l 3 (< L pd- e P d ) i s f order 3 in this case. In fig. ] 

its binding relative affinity to DNA, 
Ae r d = ~ e rd S ■ Overlaid with this data we plot the fold-change in gene expression given 
by eq. [36] where the binding strength to each type of site is determined. Notice that for each 
curve there is only one parameter to be determined, Ae rc i- This simple example shows how 
statistical mechanics arguments can be used to interpret measurements on gene expression 
in bacteria. 



6 Cooperativity in Binding: The Case Study of Hemoglobin 

So far, our treatment of binding has focused on simple binding reactions such as the myo- 
globin binding curve in fig. [8] which do not exhibit the sharpness seen in some biological 
examples. This sharpness in binding is often denoted as "cooperativity" and refers to the 
fact that in cases where several ligands bind simultaneously, the energy of binding is not 
additive. In particular, cooperativity refers to the fact that the binding energy for a given 
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ligand depends upon the number of ligands that are already bound to the receptor. Intu- 
itively, the cooperativity idea results from the fact that when a ligand binds to a protein, it 
will induce some conformational change. As a result, when the next ligand binds, it finds an 
altered protein interface and hence experiences a different binding energy (characterized by 
a different equilibrium constant). From the point of view of statistical mechanics, we will 
interpret cooperativity as an interaction energy - that is, the energy of the various ligand 
binding reaction are not simply additive. 

The classic example of this phenomenon is hemoglobin, the molecule responsible for 
carrying oxygen in the blood. This molecule has four distinct binding sites, reflecting its 
structure as a tetramer of four separate myoglobin-like polypeptide chains. Our treatment 
of ligand-receptor binding in the case of hemoglobin can be couched in the language of 
two-state occupation variables. In particular, for hemoglobin, we describe the state of the 
system with the four variables (01, 02, 03, C4), where cr* adopts the values (unbound) or 1 
(bound) characterizing the occupancy of site i within the molecule. One of the main goals 
of a model like this is to address questions such as the average number of bound oxygen 
molecules as a function of the oxygen concentration (or partial pressure). 

As a first foray into the problem of cooperative binding, we examine a toy model which 
reflects some of the full complexity of binding in hemoglobin. In particular, we imagine a 
fictitious dimoglobin molecule which has only two myoglobin-like polypeptide chains and 
therefore two O2 binding sites. We begin by identifying the states and weights as shown in 
fig. [13| This molecule is characterized by four distinct states corresponding to each of the 
binding sites of the dimoglobin molecule either being occupied or empty. For example, if 
binding site 1 is occupied we have o\ = 1 and if unoccupied then <j\ — 0. The energy of the 
system can be written as 

E = Ae((Ti + a 2 ) + J<ti<7 2 , (37) 
where Ae is the energy gain garnered by the molecules when bound to dimoglobin as opposed 
to wandering around in solution. The parameter J is a measure of the cooperativity and 
implies that when both sites are occupied, the energy is more than the sum of the individual 
binding energies. 

As we have done throughout the article, we can compute the probability of different 
states using the states and weights diagram by constructing a ratio with the numerator 
given by the weight of the state of interest and the denominator by the sum over all states. 



In analogy to eq. 13 we can write the partition function corresponding to the weights in 
fig. |13| Next we can calculate the probability of finding no oxygen molecule bound to 
our dimoglobin molecule, the probability of finding one oxygen molecule bound or that of 
finding two molecules bound. For example, if we want to compute the probability of single 



occupancy we can add up the weights corresponding to this outcome of fig. 13 and divide 
them by the total partition function which yields 



" ! -/3[(L-l)e so lutio„+ebou„d] 



Pi 



'(L-l) 



e 



^ p -/5ie 3 olutio„ +2 " L ~ 1 e~' 3 [( I -- 1 ) e °°i"tion+e bound ] , " L ~ 2 e -/3[(£-2)e solution +2 ebound + J] ' 



(38) 

Similarly to what was done in eq. [19] we can write the previous probability in terms of the 
standard state and multiply and divide by ^-e - ' 3Le50lution . This results in 

Pi = C ° , ,2 ■ ( 39 ) 

e -0Ae+J 



(ft)' 



In fig. 14 we plot this probability as a function of the oxygen partial pressure as well as po 
and P2, the probabilities of the dimoglobin molecule being empty and being occupied by 
two oxygen molecules, respectively. 
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Figure 13: States and weights for dimoglobin. The different states correspond to different 
occupancies of the two binding sites by oxygen molecules. 
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Figure 14: Probabilities of oxygen binding to dimoglobin. The plot shows the probability 
of finding no oxygen molecules bound to dimoglobing (pa), of finding one molecule bound 
(pi) and that of finding two molecules bound (p2)- The parameters used are Ae = —5 fc^T, 
J = -.25 k B T, and c = 760 mmHg. 
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Next, we calculate the average number of bound oxygen molecules as a function of its 
partial pressure. We add the number of molecules bound in each state times the probability 
of that state occurring 

2 _c_ e -/3A£ _|_ 2(_£_)2 e -/3(2A£+J) 

(Abound) =lX Pl +2xp 2 = i °° ^ C ° )2e _ 0{2Ae+J) ■ (40) 

Co ^ c ' 

To further probe the nature of cooperativity, a useful exercise is to examine the occupancy 
of the dimoglobin molecule in the case where the interaction term J is zero. We find that 
the average occupancy is given by the sum of two independent single-site occupancies as 

2^ e -/3A £ , 2CJM2 e -/32A e 2-^e _/3Ae fl + -^e-l 3 ^) jL e -P^e 
(N) = ^ = — ^ - '- = u\\ 

In considering the real hemoglobin molecule rather than the fictitious dimoglobin, the 
only novelty incurred is extra mathematical baggage. In this case, there are four state 
variables a\, 02, 03 and 04 that correspond to the state of oxygen occupancy at the four 
distinct sites on the hemoglobin molecule. There are various models that have been set 
forth for thinking about binding in hemoglobin, many of which can be couched simply in 
the language of these occupation variables. One important model which was introduced by 
Adair in 1925 25J assigns distinct interaction energies for the binding of the second, third 
and fourth oxygen molecules. The energy in this model is written as 

44 4 4 

E = AeJ2° a + J 

(JuU-t + K ^ a a apa y + L ^ a a a/3a 7 as, (42) 

where the parameters K and L capture the energy of the three- and four-body interactions, 
respectively. This model results in a very sharp response compared to a simple binding 
curve without any cooperativity. In fig. [15] we show the corresponding states for this model 
as well as the fit to the experimental data. This curve should be contrasted with the fit 



curve using the simple binding model such as eq. 19 



7 Conclusions 

We have argued that statistical mechanics needs to be part of the standard toolkit of bi- 
ologists who wish to understand the biochemical underpinnings of their discipline. Indeed, 
the ideas in this paper represent a small part of a more general text on physical biology 
entitled "Physical Biology of the Cell" worked on by all of us which aims not only to pro- 
vide the quantitative underpinnings offered by statistical mechanics, but a range of other 
tools that are useful for the quantitative analysis of living matter. Our own experiments 
in teaching such material convince us that a first exposure to statistical mechanics can be 
built around a careful introduction of the concept of a microstate and the assertion of the 
Boltzmann distribution as the fundamental "law" of statistical mechanics. These formal 
ideas in conjunction with simple, approximate models such as the lattice model of solution 
and two-state models for molecular conformations permit an analysis of a large number of 
different interesting problems. 

We challenge the notion that biologists need to understand every detail of statistical 
mechanics in order to use it fruitfully in their thinking and research. The importance of 
this dictum was stated eloquently by Schawlow who noted "To do successful research, you 
don't need to know everything. You just need to know of one thing that isn't known." To 
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Figure 15: Binding of oxygen by hemoglobin, (a) States considered in the Adair model 
and (b) plot of experimental data together with the data fit to the four parameters of 
the Adair model and to a simple no-cooperativity case. Note that the shape of the curve 
without cooperativity looks different from that in fig. [8] only because we plot it here using 
a log- scale. 



successfully apply statistical mechanics, we argue that a feeling for microstates and how to 
assign them probabilities will go a long way toward demystifying statistical mechanics and 
permit biology students to think about many new problems. 

With the foundations described in this paper in hand, our courses turn to a variety of 
other interesting applications of statistical mechanics that include: accessibility of nucleo- 
somal DNA, force-extension curves for DNA, lattice models of protein folding, the origins 
of the Hill coefficient, the role of tethering effects in biochemistry (what we like to call 
"biochemistry on a leash"), the Poisson-Boltzmann equation, the analysis of polymeriza- 
tion and molecular motors and many more. Though it is easy to make blanket statements 
about living systems being far from equilibrium, the calculus of equilibrium as embodied in 
statistical mechanics still turns out to be an exceedingly useful tool for the study of living 
systems and the molecules that make them work. 

Appendix: A derivation of the Boltzman distribution 

The setup we consider for our derivation of the probability of microstates for systems in 
contact with a thermal reservoir is shown in fig. |16[ The idea is that we have a box which is 
separated from the rest of the world by rigid, impermeable and adiabatic walls. As a result, 
the total energy and the total number of particles within the box are constant. Inside this 
box, we now consider two regions, one that is our system of interest and the other of which 
is the reservoir. We are interested in how the system and the reservoir share their energy. 

The total energy is E to t = E r + E s where the subscripts r and s signify reservoir and 
system, respectively. Our fundamental assertion is that the probability of finding a given 
state of the system E s is proportional to the number of states available to the reservoir 
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Figure 16: System in contact with a heat bath (thermal reservoir). The system and its 
reservoir are completely isolated from the rest of the world by walls that are adiabatic 
(forbid the flow of heat out or in), rigid and impermeable (forbid the flow of matter). 
Energy can flow across the wall separating the system from the reservoir and as a result, 
the energy of the system (and reservoir) fluctuate. 
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when the system is in this state. That is 



= W r {E tot -E^) 
pf ] W r (E tot - Ei 2) y 

where W r (E tot — E^) is the number of states available to the reservoir, when the system 
is in the particular state E^\ By constructing the ratio of the probabilities, we avoid ever 
having to compute the absolute number of states available to the system. The logic is that 
we assert that the system is in one particular microstate that is characterized by its energy 
E s . When the system is assigned this energy, the reservoir has available a particular number 
of states W r (E to t — E s ) which depends upon how much energy, E to t ~ E s , it has. Though 
the equations may seem cumbersome, in fact, it is the underlying conceptual idea that is the 
subtle (and beautiful) part of the argument. The point is that the total number of states 
available to the universe of system plus reservoir when the system is in the particular state 
E^ is given by 

W tot {E tot -E^)= ^ x Wr{Etot -E?>) (44) 

states of system states of reservoir 

because we have asserted that the system itself is in one particular microstate which has 
energy E),. Though there may be other microstates with the same energy, we have selected 
one particular microstate of that energy and ask for its probability. 

We now ask what the relative probability between two states is. Basically, instead of 
counting the number of microstates available to a particular states we calculate the relative 
difference in microstates available to two given states (1) and (2). This will allow us to 
calculate the probabilities of each macrostate up to a multiplicative factor. As we saw in 
the text, this multiplicative factor will be given by the partition function Z. We can then 
rewrite eq. [43] as 

W r (E tot -EP) = e SAE tot -E^),k B 

W r {E tot -E^) e^o^FW 

where we have invoked the familiar Boltzmann equation for the entropy, namely S = fc^ln W 
which can be rewritten as W = e s l kB . To complete the derivation, we now note that 
E s << E to t- As a result, we can expand the entropy as 

dS r 

S r (Etot — E s ) sa S r (Etot) — -Qj^{Etot ~ Es), (46) 

where we have only kept terms that are first order in the differences. Finally, if we recall 
the thermodynamic identity (dS/dE) = l/T, we can write our result as 



(47) 



p W ~ e -E^/k B T 

(i) 

The resulting probability for finding the system in state i with energy E$ is 

P ( s t] = — , (48) 
precisely the Boltzmann distribution introduced earlier. 
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